311

Chapter 7

Whole Genome Pattern Discovery

her a DNA sequence or a protein sequence is the basic

mponent for most genomics research. It is well understood

t a sequence is the main carrier of genetic information for

y species. It is also no doubt that genomics research can

rdly be successful without looking into sequence

nstituents. Importantly, most novel and significant

coveries in biology or medicine are based on sequencing

a analysis nowadays. This chapter will introduce the

mmonly used sequence analysis approaches from the basic

es to the advanced ones and mainly focus on the sequence

mparison approaches for whole genome pattern discovery.

oreover, this chapter will show how the sequence

mparison approaches can be used to analyse the SARS-

V-2 pandemic data.

SARS-CoV-2 pandemic

VID-19 (Coronavirus Disease 2019) pandemic caused by SARS-

evere acute respiratory syndrome coronavirus 2) is still in a huge

ng situation worldwide since it firstly emerged in WuHan, China

mber 2019 [Zhou, et al., 2020]. Till the 30th March 2021, there

n more than 128.6 million infections and more than 2.8 million

orldwide as reported at worldmeter webpage. Until the 4th January,

has collected 315,253 SARS-CoV-2 DNA genome sequences

ost all countries in the world. Based on these huge volumes of

data, there are many questions which are waiting for answers